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Abstract. An important property of high-performance, low complexity codes 
is the existence of highly efficient algorithms for their decoding. Many of the 
most efficient, recent graph-based algorithms, e.g. message passing algorithms 
and decoding based on linear programming, crucially depend on the efficient 
representation of a code in a graphical model. In order to understand the 
performance of these algorithms, we argue for the characterization of codes in 
terms of a so called fundamental cone in Euclidean space which is a function of 
a given parity check matrix of a code, rather than of the code itself. We give a 
number of properties of this fundamental cone derived from its connection to 
unramified covers of the graphical models on which the decoding algorithms 
operate. For the class of cycle codes, these developments naturally lead to a 
characterization of the fundamental polytope as the Newton polytope of the 
Hashimoto edge zeta function of the underlying graph. 



1. Introduction and Background 

Whenever information is transmitted across a channel, we have to ensure its 
integrity against errors. While data may originate in a multitude of applications, 
at some core level of the communication system, it is usually encoded as a string of 
zeros and ones of fixed length. Protection against transmission errors is provided 
by intelligently adding redundant bits to the information symbols, thus effectively 
restricting the set of possibly transmitted sequences of bits to a fraction of all 
possible sequences. The set of all possibly transmitted data vectors is called a code, 
and the elements are called codewords. A classical measure of goodness of a code is 
the code's minimum Hamming distance, i.e., the minimum number of coordinates 
in which any two distinct codewords differ. In fact, a large part of traditional 
coding theory is concerned with finding the fundamental trade-offs between three 
parameters: the length of the code, the number of codewords in the code, and the 
minimum distance of the code. 

It is well-known that the minimum Hamming distance c? of a code reflects its 
guaranteed error-correcting capability in the sense that any error pattern of weight 
at most [^^J can be corrected. However, most codes can, with high probability. 



Date: February 1, 2008. 

2000 Mathematics Subject Classification. Primary 94B; Secondary 05C. 

Key words and phrases, low-density parity-check codes, pseudo-codewords, graph covers. 

The first author was supported in part by NSF Grant #CCR-9984515. 

The second author was supported in part by NSA Grant #MDA904-03-l-0069 and NSF Grant 
#DMS-0457574. 

The third author was supported in part by NSF Grants #CCR 99-84515, #CCR 01-05719, 
and #ATM-0296033 and by DOE SciDAC and ONR Grant #N00014-00-l-0966. 
The fourth author was supported in part by NSF Grant #DMS-0302024. 

1 



2 



R. KOETTER, W.-C. W. LI, P. O. VONTOBEL, AND ,7. L. WALKER 



correct error patterns of substantially higher weight. This insight is the cornerstone 
of modern coding theory which attempts to capitalize on the full correction capa- 
bility of a code. One of the most successful realizations of this phenomenon is found 
in binary low-density parity-check (LDPC) codes. These codes come equipped with 
an iterative message-passing algorithm to be performed at the receiver's end which 
is extremely efficient and corrects, with high probability, many more error patterns 
than guaranteed by the minimum distance. 

In this situation, we are left with the problem of finding new mathematically 
precise concepts that can take over the role of minimum Hamming distance for 
such high performance codes. One of the main contributions of this paper is the 
identification of such a concept, namely, the fundamental cone 6_ of a code. Inter- 
estingly, the same cone appears when one is considering low-complexity decoding 
approaches based on solving relaxations of linear programs for maximum-likelihood 
decoding [2]. We give here a brief motivation of the concept. 

As a binary linear code, an LDPC code C is defined by a parity-check matrix H. 
The strength of the iterative decoding algorithm, i.e., its low complexity, comes from 
the fact that the algorithm operates locally on a so-called Tanner graph representing 
the matrix H . However, this same fact also leads to a fundamental weakness of the 
algorithm: because it acts locally, the algorithm cannot distinguish if it is acting on 
the graph itself or on some finite unramified cover of the graph. This leads to the 
notion of pseudo-codewords, which arise from codewords in codes corresponding to 
the covers and which compromise the decoder. Thus to understand the performance 
of LDPC codes, we must understand the graph covers and the codes corresponding 
to them. As will be seen later in the paper, this is tantamount to understanding 
a cone in R" defined by inequalities arising from H, called the fundamental cone. 
We show that the pseudo-codewords of C (with respect to H and the associated 
Tanner graph) are precisely the integral points in the cone which, modulo 2, reduce 
to the codewords of C. 

We emphasize below a few properties of the fundamental cone which appear to be 
central to a crisp mathematical characterization. A recurring theme is that these 
properties depend upon the representation of the code as the kernel of a given 
parity-check matrix, and not solely upon the code itself as a vector space. This 
showcases the modern viewpoint of coding theory: whereas, classically, the quality 
of a code was measured in terms of properties (e.g., length, dimension, minimum 
distance) of the collection of codewords comprising the code, the quality of a code 
is now measured in terms of properties (e.g., existence of pseudo-codewords of 
small weight) of a particular representation of the code. Thus, from the modern, 
algorithmic point of view, a given collection of codewords might be described by 
two different parity check matrices, one of which might be considered to be very 
good while another would be very bad. 

• The fundamental cone depends on the representation chosen for the code in 
terms of a parity-check matrix. Note that a linear code has many different 
parity-check matrices and hence many different cones. This reflects the 
property of message-passing algorithms that both the complexity and the 
performance are functions of the structure and, in particular, the sparsity 
of the parity-check matrix. 

• The fundamental cone is an essentially geometric concept relating only to 
the parity-check matrix and independent of the channel on which the code 



CHARACTERIZATIONS OF PSEUDO-CODEWORDS OF LDPC CODES 



3 



is employed. Thus we can study codes and their parity-check matrices 
independently of a specific application. 
• The fundamental cone has close ties with well-established mathematical 
objects. If the parity-check matrix is chosen to be the (highly redundant) 
matrix containing all words in the dual of the given code, it is readily 
identified as the metric cone of a binary matroid T, ch.27], and it is well- 
studied in this special case. Furthermore, for the particular class of LDPC 
codes called cycle codes, it is shown in that the fundamental cone is 
identified with the Newton polyhedron of Hashimoto's edge zeta Junction 
0] of the normal graph associated to the Tanner graph of the code. 

The last bullet above implies that the pseudo-codewords of a cycle code can be 
read off from the monomials occurring in the power series expansion of the asso- 
ciated zeta function. This gives another characterization of the pseudo-codewords 
for cycle codes. Inspired by this result, we draw an analogous connection between 
the pseudo-codewords of a general LDPC code (with respect to a given parity- 
check matrix), and the monomials of a certain type occurring in the power series 
expansion of the edge zeta function of the associated Tanner graph. 

In summary, we believe that the here-begun study of codes from the perspective 
of their efficient representation, as reflected in the fundamental polytope, holds the 
key to a thorough understanding of high performance codes and message-passing 
decoding algorithms. 

The remainder of this paper is organized as follows. In Sectional we give back- 
ground on LDPC codes and pseudo-codewords. Section |3| provides a technical yet 
crucial result about graph covers and their associated matrices. A characterization 
of pseudo-codewords in the general case via the fundamental cone is given in Sec- 
tional In Sectional we restrict our attention to the special case of cycle codes and 
draw the connection to Hashimoto's edge zeta function. We return to the general 
case in SectionlHl where we show that every LDPC code can be realized as a punc- 
tured subcode of a code of the type considered in the previous section. Using the 
results of Section (SJ we then characterize the pseudo-codewords in the general case. 

2. Low-Density Parity-Check Codes 
We begin with a definition. 

Definition 2.1. Any subspace C of is called a binary linear code of length n. 
If C is described as the null space of some matrix H , i.e., 

C = {c e F^' I Hc^ = 0} , 

then H is called a parity-check matrix for C . If H is sparse^, we call C a low-density 
parity-check (LDPC) code. 

Notice that the columns of H correspond to the coordinates, i.e., hits, of the 
codewords of C, and the rows of H give relations, i.e., checks, that these coordinates 
must satisfy. Although every code has many parity-check matrices, we will always 
fix a parity-check matrix H for each code we discuss. 

^The term "sparse" is necessarily vague, but typically one assumes that the number of I's 
in each column is much smaller than the number of rows. When considering a family of LDPC 
codes defined by a family {-ffi}i>o of X rii matrices with rii growing increasingly large but ri/rii 
remaining fixed, "sparse" means that the number of I's in the columns of the Hi is bounded by 
some constant. 
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The iterative decoding algorithms mentioned in Section ^ operate on a bipartite 
graph, caUcd the Tanner graph, associated to the matrix H. 

Definition 2.2. An undirected graph G = {V, E) consists of a set V of vertices and 
a coUection E of 2-subsets of V called edges. We say G has multiple edges if some 
2-subset {v,id} of V appears in E at least twice. We say two vertices v,w € V 
are adjacent if the set {v,w} is an edge. In this case, we say the edge {v,w} is 
incident to both v and w. For v & V, we write d{v) for the neighborhood of v, 
i.e., the collection of vertices of G which are adjacent to v. A bipartite graph with 
partitions A and B is an undirected graph G = (V, E) such that V can be written 
as a disjoint union V = AU B with no two vertices in A (resp., B) adjacent. 

We make the following conventions: Unless otherwise specified, our graphs will 
always be undirected and our bipartite graphs will never have multiple edges. 

Definition 2.3. Let C C F2 be the LDPC code determined by the (sparse) r x n 
matrix H = (hji). The Tanner graph T{H) is the bipartite graph defined as 
follows. The vertex set consists of the bit nodes X — {xi, . . . ,Xn} and the check 
nodes F — {/i, . . . , /r}. The set {xi, fj} is an edge if and only if hji = 1. 

Notice that the bit nodes in the Tanner graph correspond to the columns of H, 
the check nodes correspond to the rows of H, and the edges record which bits are 
involved in which checks. In other words, the graph T{H) records the matrix H , 
and hence the code C, graphically: a binary assignment of the bit nodes (ci, . . . , c„) 
is a codeword in C if and only if the binary sum of the values at the neighbors of 
each check node is zero. Because we have fixed a parity-check matrix H for G from 
the start, we will also refer to T = T{H) as the Tanner graph of the code G. 

Example 2.4. Let G be the binary linear code of length 7 with parity-check matrix 
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Two representations of the Tanner graph T = T{H) associated to H are given 
in Figure ^ where bit and check nodes are represented by empty circles and filled 

Xi 

Xi 

X3 

Xi 
Xr, 
Xe 



Figure 1 . Two representations of the Tanner graph for the code 
G of Example in 



H = 



(1 


1 

















1 


1 


1 








1 





1 




















1 


1 

















1 


1 
















1 




CHARACTERIZATIONS OF PSEUDO-CODEWORDS OF LDPC CODES 



5 



squares, respectively. The graph on the left is a traditional rendering of a bipartite 
graph, but the one on the right is easier to work with. The vector 

C = (ci,C2,C3,C4,C5,C6,C7) := (1,1,1,0,0,0,0) 

is a codeword in C. This can be checked either by computing Hc^ or by assigning 
the value Ci to each bit node Xi in T and verifying that the binary sum of the values 
at the neighbors of each check node fj in T is zero. A 

Any iterative message-passing decoding algorithm, roughly speaking, operates 
as follows; see for a more precise description. A received binary word gives an 
assignment of or 1 together with a reliability value at each of the bit nodes on the 
Tanner graph. Each bit node then broadcasts this bit assignment and reliability 
value to its neighboring check nodes. Next, each check node makes new estimates 
based on what it has received from the bit nodes and sends these estimates back to 
its neighboring bit nodes. By iterating this procedure, one expects a codeword to 
emerge quickly. Notice that the algorithm is acting locally, i.e., at any stage of the 
algorithm, the decision made at each vertex is based on information coming only 
from the neighbors of this vertex. It is this property of the algorithm which causes 
both its greatest strength (speed) and its greatest weakness (non-optimality) . In 
order to quantify this weakness, we will need another definition. 

Definition 2.5. An unramified, finite cover, or, simply, a cover of a graph G = 
{V, E) is a graph G — {V , E) along with a surjection tt : V ^ V which is a graph 
homomorphism (i.e., tt takes adjacent vertices of G to adjacent vertices of G) such 
that for each vertex v G V and each v £ tt~^{v), the neighborhood d{v) of v is 
mapped bijectively to d{v). A cover is called an M -cover, where M is a positive 
integer, if |7r^"'^(u)| — M for every vertex v in V. 

Example 2.6. We return to the code G with chosen parity-check matrix H of 
Example 12.41 the corresponding Tanner graph T = T{H) was given in Figure Q 
An example of a 2-cover (or double- cover) T of T is given in Figure[21 The bipartite 
graph T is the Tanner graph for a code C of length 14 = 2 • 7. 




Figure 2 . A 2-cover of the code G from Example l2.4l as described 
in Example 12.61 
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The parity-check matrix H for the code C associated to T is the 12 x 14 matrix 
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where 

^ '^0 ly ' " ^ \ i oy ' "~ \ Q 0^ 

the rows are ordered to correspond to the check nodes /(i,i), /(i,2)j • • ■ i /(6,i)j /(6,2)j 
and the columns are ordered to correspond to the bit nodes X(i_2), ■ ■ ■ , ^^(y^i), 

X[7,2)- A 

Suppose T is a Tanner graph for the binary hnear code C C F2 and T is an 
M-cover of T for some M > 1. Let C C Fj*^ be the binary hnear code determined 
by T. To indicate that the coordinates of F2''^ are ordered as in Example 12.61 with 
each successive block of M coordinates lying above a single coordinate of Fj , we 

nl 
2 

(a;(i_i): . . . :a;(i,j\/), • ■ • , ^^(n,!): • ■ • '■X(n,M)] 



will write an element x of F? as 



Every codeword in C yields a codeword in C by "lifting" : if c = (ci, . . . , c„) is in 
C, then the vector 

c = (c(i,i): • • ■ :c(i,Af), • ■ • , C(„4): . . . :C(„^m)), 

where C(i^fe) = Ci for 1 < i < n and 1 < fc < A/, is in C . However, there are also 
codewords in C which are not liftings of codewords in C . 

Example 2.7. Once again, let C be the code of Examples 12.41 and 12. til and let 

C be the code corresponding to the double-cover T of the Tanner graph T for C, 
as in Example 12.61 The codeword c = (1, 1, 1, 0, 0, 0, 0) of C lifts to the codeword 
c = (1:1, 1:1, 1:1,0:0,0:0,0:0,0:0) of C. Ahhough it is easily checked that the vector 

a:= (1:0,1:0,1:0,1:1,1:0,1:0,1:0) 

is a codeword in C, it is certainly not a lifting of any codeword in C . A 

Notice that, in general, if 

a = • • • :a(i,Af), • • ■ , a(Ti,i): • • • '■o.(n,M)) 

is a codeword in the code corresponding to some M-cover T of T, then for any 
permutations (Ti, . . . , cr„ on {1, . . . , M}, there is an Af-cover T' of T such that 

a' = (a(i,<Ji(i)): ■ • ■ '■a(i,cn(M)), • • ■ , a(n,(T„(i)): • ■ ■ :a(n,c7„(A/))) 

is a codeword in the code corresponding to T' . This motivates the next definition. 

Definition 2.8. Let C C F2 be a binary linear code with Tanner graph T and let 

^~ : • • • :a(i,M) I • • ■ I : ■ • • ■^(n.M) ) 

be a codeword in the code C corresponding to some Af-cover T of T. The unsealed 
pseudo-codeword corresponding to a is the vector p(a) := (pi, . . . ,p„) where, for 
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1 < i < n, Pi is the number of nonzero a(^i i^^, 1 < k < M. The normalized pseudo- 
codeword corresponding to a is the vector a; (a) = {lui, . . . ,LUn) where each LUi is a 
rational number, < iVi < 1, given by uji := jjPi for 1 < « < M . 

Example 2.9. The unsealed pseudo-codeword corresponding to the codeword a 
on the 2-cover of Example 12.71 is (1,1,1,2,1,1,1). The corresponding normalized 
pseudo-codeword is (i, i, i, 1, i, i, i). A 

Notice that if c is a codeword in our original code and c is the lifting of this 
codeword to the code corresponding to some finite cover of the Tanner graph, then 
ijj{c) = c. Indeed, the entries of a normalized pseudo-codeword will be entirely O's 
and I's if and only if it comes from the lifting of some actual codeword. Otherwise, 
there will be at least one entry which is non-integral. 

The key issue with graph covers is that locally, any cover of a graph looks exactly 
like the original graph. Thus, the fact that the iterative message-passing decoding 
algorithm is operating locally on the Tanner graph T = T{H) means that the 
algorithm cannot distinguish between the code defined by T and any of the codes 
defined by finite covers of T. This implies that all the codewords in all the covers 
are competing to be the best explanation of the received vector. 

To make this more precise, we assume for simplicity that we are operating on 
the binary symmetric channel; the situation for other channels is similar (see jBj). 
Under this assumption, a transmitted bit is received correctly with probability 1 — e 
and incorrectly with probability e where < e < ^. 

The goal of any decoder is to find the codeword c G C C F2 that best explains 
(in some sense) the received vector y G F2. For the binary symmetric channel, a 
maximum likelihood decoder will find the codeword which is closest in Hamming 
distance to y. On the other hand, because the iterative decoder of an LDPC code 
acts locally on the Tanner graph associated to the code, it allows all codewords 
from all finite covers to compete to be the best explanation of y. In a sense, it 
automatically lifts y to vectors y G Fj*^ for every AI > 1 and searches for a 
codeword a in some code C C Fj corresponding to some AI -cover of the Tanner 
graph, for some AI > 1, such that jj times the Hamming distance from (the 
appropriate) y to a is minimal among all codewords in all codes corresponding to 
all finite covers of the Tanner graph. Note that even if fewer than [^^J errors 
have occurred (where d — (imin(C') is the minimum Hamming distance of the code), 
there may be codewords in covers which are at least as close, in this sense, to y as 
is the unique closest codeword. 

Example 2.10. Consider again the code C from Examples 12.41 TTM and 12.71 As- 
sume that wc are transmitting over a binary symmetric channel and we receive the 
vector y = (1,0,1,1,0,1,0). 

One can check that the codeword c = (1, 1, 1, 0, 0, 0, 0) satisfies d{y, c) = 3 and 
that the Hamming distance from y to any other codeword in C is larger than 3. 
Therefore, a maximum-likelihood decoder would output the codeword c when y is 
received. 

But the iterative message-passing decoding algorithm allows all the codewords 
in all the codes corresponding to all the finite covers to compete. In particular, 
the vector a = (1:0, 1:0, 1:0, 1:1, 1:0, 1:0, 1:0) from Example \Tk lies in the code C 
corresponding to the double cover T of T and is hence a competitor. Letting 
y = (1:1,0:0,1:1,1:1,0:0,1:1,0:0) be the lifting of y to F^'', we see that ^ times 
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the Hamming distance from y to a is also 3. Hence a is just as attractive to the 
iterative decoder as c is. The iterative decoder becomes confused. A 

The situation observed in Example 12.101 happens in general: one can easily ex- 
hibit a received vector y and a codeword a in an Af-cover for some M such that -jj 
times the distance from y to a is at most (i(y, c) for any codeword c in the original 
code. As mentioned above, there is nothing special about the binary symmetric 
channel, and so the above statements can easily be generalized to other channels. 

Thus, in order to understand iterative decoding algorithms, it is crucial to un- 
derstand the codewords in the codes corresponding to all finite covers of T{H). The 
remainder of this paper is devoted to this task. 

3. Liftings 

We saw in Section|21above that understanding finite covers of graphs is crucial to 
understanding the performance of the iterative decoding algorithm used for LDPC 
codes. The main result of this section. Theorem 13. 31 will help us to reach this goal. 
Though it is rather technical, the remainder of the paper hinges upon it. 

We first state a lemma, the proof of which follows immediately from the definition 
of an M-cover (Definition 12.51) . 

Lemma 3.1. Let H = (hji) be the parity-check matrix associated to the Tanner 
graph T and let T he an M-cover of T . Let H = 1 < k < M and 

1 < I < M , be the parity-check matrix associated to T . Then for each i and j , there 
is a permutation aji on {1, . . . , M} such that h^j i) (^i ]^-) — 1 if and only if hji — 1 
and o'ji{l) — k. Conversely, choosing permutations aji on {1, . . . , M} for all i and 
j uniquely and completely determines an M-cover T of T and its corresponding 
parity- check matrix H . 

A simple interpretation of Lemma [3. II is that if H has associated Tanner graph 
T and T is an M-cover of T, then the matrix H associated to T can be obtained 
by replacing each oi H with an M x M matrix of O's and each 1 oi H with a 
suitably chosen M x M permutation matrix. We need one more definition before 
we can state the main result of this section. 

Definition 3.2. Let G = {V,E) be a graph. Fix an ordering of the edges, so 
that we have E = {ei, . . . , e„}. A sequence of edges {ei^ , . . . ,ei^) of G is a path 
on G if the edges e^^ can be directed so that e^^ terminates where ei^_^-^ begins for 
1 < s < fc — 1. We say the path is backtrackless if for no s do we have e^^ = ^is+i- 
We say two paths are edge-disjoint if they do not share an edge. 

The next theorem is the main result of this section. It gives conditions under 
which a collection of edges, with multiplicities, on a graph may be lifted to a union 
of edge-disjoint paths on some finite cover of the graph. It will be used in Section^ 
to show that every vector of nonnegative integers which lies in the fundamental 
cone and which reduces modulo 2 to a codeword must be a pseudo-codeword, and 
that result will be used in turn in Section to characterize pseudo-codewords in 
the case in which all bit nodes in the Tanner graph have even degree. The proof is 
constructive, providing an algorithm to produce the desired paths. 

Tiieorem 3.3. Let T ~ (XU-F, E) be a bipartite graph. Suppose that to each e Cz E 
there is assigned a nonnegative integer me such that 
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(H.l) For each x ^ X, there is a nonnegative integer Mx such that me = Mx for 

every edge e incident to x. 
(H.2) For each f & F, the sum ^ Mx is even. 

xed(f) 

(H.3) For each f G F and each x e d{f), we have My > Mx- 

vedU)\{x} 

Then there is a finite cover it : T := (XUF, E) ^ T and a union A := Ai U- • • U Ap 
of backtrackless paths Aj on T such that the endpoints of each Aj are in X and 
such that 

(C.l) Each f G F occurs in A at most once. 
(C.2) Each e € E occurs in A at most once. 

(C.3) At each x € X, either all or none of the edges incident to x in T occur in 
A. 

(C.4) For each e G E, we have |7r~^(e) fl A| = mg. 

Proof. We will refer to X as the set of bit nodes of T and to F as the set of check 
nodes of T. Let F be the multiset of edges of T which contains, for each e G a 
total of rUe copies of e. For each f G F, let Nj be the number of edges in T which 
are incident to /, counted with multiplicity. In other words, 

xedif) 

Set M := max {{Mx \xGX}U {^Nf \f gF}). We construct an M-cover tt : T 
T and the desired A explicitly. The vertex set of T is 

{a;fe I a; e X and 1 < fc < M} U {/, I / e F and 1 < ; < M} 

and the map tt : T — > T is given by 7r(xfe) = x, n{fi) = f. We now need to describe 
the edges of T and the disjoint paths Aj. We will first describe the edges of T 
which are involved in the A^'s, and then we will describe the remaining edges of T. 
The bit nodes of T involved in the Aj's are {xk \ x G X and I < k < Mx} and the 
check nodes of T involved in the Aj's are {fi\fGF and 1 < I < ^Nf}. 

Start by writing out, for each x G X, Mx copies of the list d{x) of neighbors 
of x; label these lists using the bit nodes Xi, . . .Xm^ of T lying above x so that 
L{xi) . . . , L{xm^) are the Mx copies of d{x). Notice that there is a 1-1 correspon- 
dence between the edges in T (with multiplicity) and pairs [xk , /) where / occurs 
in L{xk). Similarly, write out, for each f G F, one copy of the list d{f) of neighbors 
of /, but then replace each x appearing in the list with the bit nodes xi, . . . ,xm^ 
of T so that the list has length Nf, call this list L{f). Again, we have a 1-1 corre- 
spondence between the edges in T (with multiplicity) and the pairs {f,Xk), where 
Xk occurs in L{f). We will construct the Aj's one vertex at a time. Each time 
we add a vertex (except for the initial vertex of each A^), we arc choosing an edge 
from r and lifting it to T, and so we will cross one check node off a list labeled by 
a bit node and one bit node off a list labeled by a check node. Thus the lists L{xk) 
and L{f) change as the algorithm proceeds. 

We will need some terminology and notation in the construction: 

• At any given point in the algorithm and for any vertex v, let the current 
weight of v be the number of elements in L{v). 
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• At any given point in the algorithm and for x ^ X and f & F, set mf{x) := 

Notice that since, as mentioned above, the hsts L{v) change as the algorithm pro- 
ceeds, the current weight of a vertex and the value mf{x) for x G X and f £ F 
do as well. At the beginning, the current weight of Xk {x € X and 1 < fc < Mx) 
is \d{x)\, the current weight of f E F is Nf, and mf{x) = if / £ d{x) and 
otherwise. To construct the A^'s which form A, we proceed as follows: 

(1) Choose a bit node of T whose current weight is at least that of every other 
bit node of T and take it to be the first vertex in a path P. 

(2) Suppose we have just added the bit node Xk to P, where x E X and 
1 < fc < Mx, and that L{xk) 7^ 0. Choose a check node / G L{xk) such 
that the current weight of / is at least that of any other check node in 
L{xk)- Write down /s as the next vertex of P, where s is the number of 
times (including this one) that / has been used so far in all of A. Cross / 
off L{xk) and cross Xk off L{f). 

(3) Suppose we have just added the bit node Xk and then the check node fs 
to P, where xSX, / gP, l<fc< M^, and 1 < s < Nf. Let P(/) \ x 
denote the set of vertices in P(/) which are not of the form Xi for any i; 
ClaimQlbelow shows that L{f) \ x \s nonempty. Let yi G P(/) \ a: be such 
that mf{y) > mf{w) for all w such that Wt G P(/) \ x for some t and the 
current weight of yi is at least that of any other yi G L{f). Append the 
vertex yi to P. Cross yi off L{f ) and / off L{yi). If L{yi) is now empty, 
then P is complete and will be one of the A^'s in the disjoint union A. 
Otherwise, return to Step 

(4) If there are nonempty lists remaining, start over with Step on the re- 
maining set of vertices. Otherwise, A is the union of the A^'s and the 
algorithm is complete. 

It is now clear from the construction and hypothesis (HQ that A = Ai U • • • U Ap 
is a union of paths satisfying conditions (C^l, (C[2Jl and (C0J). Claimnbelow shows 
that each A,; is backtrackless, and hypothesis implies that the ending vertices 

must be bit nodes since the starting vertices are. To see that condition (COJ holds, 
let a: G X and consider two cases. If fc > M^, then Xk is not involved in A and so 
no edge incident to Xk occurs in A. If 1 < fc < M^, we have \d{x)\ edges incident 
to Xk involved in A. Since the degree of Xk in T is to be the same as the degree of 
X in T, we have that all edges which are to be incident to Xk in T occur already in 
A. 

All that remains is to add additional edges to T so that tt : T — > T is an M-cover. 
In order for tt : T — > T to be an M-cover of T, we must have, for each bit node x 
of T and each fc with 1 < fc < M, 

\d{xk)\ = \d{x)\ 

and 

d{xk) for some 1} = d{x). 

Let X X. As mentioned above, these properties hold already for Xk with 
1 < fc < Mx, and we have constructed no edges involving the M — M^ other bit 
nodes Xk of T. Similarly, for each check node / with / G d{x), we know that exactly 
Mx of the vertices /* are connected by an edge to some Xs , which means that there 
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are M — Mx vertices /; which are not connected by an edge to any Xs- We can 
pair up these M — Mx bit nodes Xk and these M — Mx check nodes /; any way we 
please. In particular, this will not change any bit nodes already involved in our A, 
and when we are done doing this for each x, we will have the Af-cover tt : T ^ T 
and the A we seek. 

The proof of Theorem 13.31 will be complete once we have proven Claims 

Claim 1. Steps ^ and © can always be performed without introducing a back- 
track. In particular, the set L{f) \ a; in Step lUJ of the algorithm is nonempty. 

Proof. For each bit node w G X, let rrff(w) be the value of mf{w) at the start of 
Step |(2Jl and let m'j{w) be the value of mf{w) at the end of Step ©. For each 
w G X and each f E F, let P{w, /) denote the inequality 

yeX\{w} 

and let P{w,f) denote the inequality 

yeX\{w} 

Notice that at the start of the algorithm, P{w,f) is true for every w and / by 
hypothesis (R^. 

Suppose P{w, /) holds for every w and / and that we are at the start of Step jSJ, 
having just appended Xk to P. We will show that we can perform Steps (j^J and 
without introducing a backtrack, and that the inequalities P{w, /) will hold when 
we are done with these two steps. This will mean that we can continue to perform 
these steps until we are forced to move on to Step 

Since each check node occurs in L{xk) at most once, we know that L{xk) no 
longer contains the check node we appended to A just before we appended x^- So, 
since L{xk) is, by assumption, nonempty. Step 121 can be performed and it does 
not introduce a backtrack; let / be the check node appended to A in that step, so 
that m'j{x) — m^j:{x) — 1. Since P{x, /) held before Step ij^Jl, we know that there 
is at least one y ^ x such that iji G L{f) for some i, i.e., L{f) \ x is nonempty. 
So Step can be performed, and we have m'j{y) = m^jijj) — 1 for the y G X 
chosen in that step. For all other bit nodes w, we have rrfj{w) — rrfj{w). We now 
need to show that the inequality I°{w,f) holds for every w € X. First note that 
P{x,f) is obtained from P{x,f) by subtracting 1 from each side. Since P{x,f) 
held, I°{x,f ) must also. The same argument shows that I'^{y,f) holds. Further, 
7*^(w, /) holds whenever mj{w) = m^f{w) = since what appears on the left-hand 
side of P{w,f) is certainly nonnegative. Hence we need only show that P{w,f) 
holds for w £ X \ {x,y} with TOj(w) = TOj(w) > 1. 

So suppose m'j:{w) — my(w) > 1. Consider first the case where m'j{v) = 
rrfj{v) = for all v £ X \ {x, y, w}. Then the inequality /''(u;, /) says 

m'f{x) + ^^^{y) > m°f{w), i.e., m}(x) -I- mj(j/) - 2 > mj(w). 

If my{y) = m^j:(w), then, since mj(a;) -I- Trfj{y) + nf^{'w) is even, we know that 
rrffix) > 2 and so holds. Otherwise, we have nf^{y) > m^j{w) + 1 and so, 

since m^f{x) > 1, we again see that /) holds. 
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Now consider the case that there is at least one bit node v ^ X \ {x, y, w} with 
TOy:(ti) — m'jiv) > 1. Then it is enough to show that 

m'fix) + m^y) + m°f{v) > m'j{w). 

But this is the same as 

m^x) + m^y) + m^v) - 2 > m^w). 

Since rrff{y) > rrf^{w) and each of rrfj{x) and m^^{v) is at least 1, this latter 
inequality holds and so P{'w, f) does as well. □ 

This completes the proof of Theorem 13. 31 □ 
4. The Fundamental Cone 

The pseudo-codewords are described for general LDPC codes by the fundamental 
cone. 

Definition 4.1. Let H ~ (hji) be an r x n matrix with hji e {0, 1} for each j and 
i. The fundamental cone JC{H) of H is the set of vectors u — {vi, . . . , G M" 
such that, for all 1 < i < rt and 1 < j < r", we have 

(4.1) ly, > 
and 

(4.2) '^hjiiVi, > hjiUi. 

i' 

Remark 4.2. The matrices H we consider will be parity-check matrices of binary 
linear codes. As such, we will sometimes be doing computations over F2 (e.g., when 
deciding if a vector is a codeword) and sometimes over M (e.g., when deciding if a 
vector is in the fundamental cone). Although the field over which we are working 
should usually be clear from context, we will typically specify it explicitly to help 
avoid confusion. 

Example 4.3. The fundamental cone JC{H) for the parity-check matrix H of the 
code C from Example 12.41 is 

z/,- > for 1 < i < 7 ^ 



Vl=V2 = 1^3, Vz = Vq = V-j 



Notice that the unsealed pseudocodeword (1,1,1,2,1,1,1) and the normalized pseu- 
docodeword (i, i, i, 1, i, i, i) from Example 12. 91 he in K,{H). A 

The importance of the fundamental cone is illustrated below by Theorem 14.41 
CoroUarv 14.51 and Theorem 14. 61 

Tiieorem 4.4. Let H = {hji) be an r x n matrix with hji G {0, 1} for each j and 
i, K. — IC{H) the fundamental cone of H, and C the binary code with parity-check 
matrix H. Let p = (pi, . . . ,p„) be a vector of integers. Then the following are 
equivalent: 

(1) p is an unsealed pseudo-codeword. 

(2) p G /C and Hp^ = G F^. 

In other words, the unsealed pseudo-codewords are precisely those integer vectors in 
the fundamental cone which reduce modulo 2 to codewords. 
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Proof. Suppose that p is an unsealed pseudo-codeword. Then there is an M-cover 
T of the Tanner graph T associated to H and a codeword 

c = (c(i,i): • ■ • :C(i,Af)j • ■ • ; C(„^i): . . . :C(„^m)) 

in C, the code associated to T, such that, for each i, exactly pi of the coordinates 
C(i,fc), 1 < < M, are 1. Let H — {h(^j^iy(^i^k)): where l<j<r, l<i<n, 
1 < k < M, and 1 < / < M, be the parity-check matrix of the code C associated to 
T. For each i and j, let aji be as in Lemma l3. II so that = 1 if and only 

if hji = 1 and k — aji{l). Then the equation = G Fj^^ implies that, in F2, 
we have for each j and 

n AI n 

(4.3) = ^ ^ /io-,o,(»,fc)C(4,fc) = X! ^J'H^^'^j'in)- 

i = l k=l 4=1 

We shall use this observation to prove that p G /C and that Hp^ = G F2. 

We first show that p G /C. Clearly inequalities H4.1|l hold for 1/ = p, and we must 
show that inequalities (|4.2() do as well. Thus, we must show that we have 

l<k<Mi'^i l<k<M 

for each i and j. Certainly (|4.4II holds if hji = or if C(i jt) — for all k. So assume 
hji = 1 and not all C(^i k) are zero. For each k with C(^i k) — 1, set Ik '■— cyji^{k). 
Then we have by H4.3|l that the integer sum 

n 
i' = l 

is even. Hence, for each k with C(^i^k) = I7 there is at least one value of i' ^ i such 
that hjii = C(i/ g.^., (jj^)) = 1. Note that as k varies, the indices (i', ajii{lk)) are all 
distinct. Thus (|4.4(l holds and so p G /C. 

To see that Hp^ = G F2, sum (|4.3|l over / to get that for each j, we have 

M n 
1=1 1=1 

in F2. After interchanging the summations over I and i, we may use the fact that 
cFji is a permutation and substitute the summation variable ^ by A; = to get 

n AI n 

= XI '^j' "^(^.fc) ^iiPi 

4=1 fc = l 4=1 

in F2, i.e., Hp'^ = G F^. 

Conversely, suppose p = (pi,...,p„) G /C and Hp^ = G F2. Let T be 
the Tanner graph associated to H, and label the bit nodes of T as xi, . . . , to 
correspond to the n columns of H. For 1 < j < n, set M^. = pi. For each edge e of 
T, there is a unique i, 1 < « < ri, such that e is incident to Xi] set me := Pi for this 
value oil. Then hypothesis (HQ of Theorem l3.3l is satisfied. That hypothesis (HlSJ 
is satisfied follows directly from the fact that Hp^ = G Fj. The fact that p G /C 
says that hypothesis (H|21l holds. Thus Theorcm l3 . 31 applies and we have a finite M- 
cover T of T for some M >\ and a union A := Ai U • • • U Ap of backtrackless paths 
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on T starting and ending at bit nodes of T and satisfying conditions (CQ)-(C01) 
of that theorem. Label the bit nodes of T as X{i^k) for 1 < i < n and 1 < fc < M, 
and let 



C — (C(l,l): • • ■ :C(i,Af), • ■ • , C(n,iy-C{n,M)) ^ ^ 



iM 



2 

be the vector given by the rule C(i.fc) = 1 if and only if a;(i^fc) occurs in A, i.e., if 
and only if 1 < A: < p.;. Then conditions (C0-(C21) ensure that c is a codeword 
in the code corresponding to T . Finally, we see that the unsealed pseudo-codeword 
associated to c is precisely p. □ 

Corollary 4.5. Every normalized pseudo- codeword lies in the fundamental cone. 

Proof. Let lj — -p-p be a normalized pseudo-codeword, where p is an unsealed 
pseudo-codeword coming from a codeword in the code corresponding to some M- 
cover. Then p G /C by Theorem 14 . 41 and so a; e /C since /C is a cone. □ 

Theorem 4.6. The rays through the pseudo- codewords are dense in the fundamen- 
tal cone. More precisely, let C he a binary linear code with parity-check matrix H , 
Tanner graph T = T{H) and fundamental cone fC = JC{H), and let u £ K,{H). 
Then for any e > 0, there is a pseudo-codeword p such that | |ap — fW < e for some 
a > 0. 

Proof. Let n be the length of C, so that v ~ {i^i, . . . , Vn) G K". Choose /3 £ K 
sufficiently large so that the vector p := p(/3) = (pi, . . . ,p„), where Pi = 2[/3fi], 
satisfies ||ap — I'll < e for some a > 0. For example, if n = 1 we may take P = j 
and a = |. 

We claim p £ A^. Certainly pi > for 1 < i < n, and we must show that 
inequalities 1)4.2(1 hold for p. Since i/ £ JC{H) by assumption, we know that in- 
equalities ((4.2(1 hold for i/. Multiplying both sides by f3 and taking ceilings yields, 
for all i and j. 



Since each pi is even, we have i/p^ = £ Fj, and so p is an unsealed pseudo- 
codeword by Theorem 14.41 □ 

5. Cycle Codes 

A binary linear code C defined by a parity-check matrix H is called a cycle code 
if all bit nodes in the associated Tanner graph T{H) have degree 2. The pseudo- 
codewords of cycle codes were studied by the authors in j^. In this section, we 
review the results of that paper. In Sectional we will show that every LDPC code 
can be realized as a punctured subcode of a cycle code, and use that relationship 
to give a nice characterization of the pseudo-codewords in the general case. 

The pseudo-codewords of cycle codes can be described in terms of the monomials 
appearing in the edge zeta function 0], |H| of the normal graph ^ of the code. We 
begin with some definitions. 

Definition 5.1 (O). Let C be a cycle code with parity check matrix H and 
associated Tanner graph T. Let X be the set of bit nodes of T and let F be the 
set of check nodes of T. The normal graph of T (or of H , or of C) is the graph 
N = N{T) = N{H) with vertex set F and edge set {d{x) \x^X}. 
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Example 5.2. Since all the bit nodes of the Tanner graph of the code C from 
Example 12.41 have degree 2, C is a cycle code. The normal graph C is formed by 
simply dropping the bit nodes from the Tanner graph. It is shown in Figure|21 The 
edge d{xi) is labeled by e^. 

Definition 5.3. Let G = {V,E) be a graph. Fix an ordering of the edges, so 
that we have E — {ei, . . . , e„}. A sequence of edges (e^i , . . . ,ei^) of G is called a 
cycle if the edges e^^. can be directed so that e^^ terminates where ei^_^-^^ begins for 
1 < s < fc — 1 and Cii^ terminates where e^^ begins, i.e., a cycle is a path which 
starts and ends at the same vertex. We say the cycle is edge-simple if Cij ^ e^j for 
j ^ I. We say the cycle is simple if each vertex of G is involved in at most two 
of the edges , . . . , e^j. ; note that every simple cycle is necessarily edge-simple. 
The characteristic vector of the edge-simple cycle (e^j, . . . , e^j.) on G is the binary 
vector of length n whose t^^ coordinate is 1 if and only if et appears as some Ci- . 

The significance of the term cycle code is illustrated by the following Lemma, 
which follows from Euler's Theorem Th. 1.2.26]. 

Lemma 5.4 ([Sj)- 

(1) Let C be a cycle code with Tanner graph T and normal graph N = N{T). 
Then C is precisely the code spanned by the characteristic vectors of the 
simple cycles in N . 

(2) Let G — (y, E) be any graph and let C be the code spanned by the char- 
acteristic vectors of the simple cycles in G. Let T — T{G) be the bipartite 
graph described as follows: The vertex set of T is E U V . If e Cz E and 
V £V, then the pair {e,v} is an edge of T if and only if e is incident to v 
in T. Then the degree in T of every vertex e ^ E is 2, and C is precisely 
the cycle code with Tanner graph T . 

In light of Lemma 15.41 if G is any graph, we call the code spanned by the 
characteristic vectors of the simple cycles in G the cycle code on G. In order to 
define the edge zeta function of N ^ we need some more definitions. 

Definition 5.5. Let F = [ci^ , . . . , 6;^.) be a cycle in a graph X . We say F is tailless 
if 7^ . We say F is primitive if there is no cycle Q on X such that F = O'' 



h h 




Figure 3. The normal graph of the code G from Example 12.41 as 
described in Example 15.21 
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with r > 2, i.e., such that T is obtained by following 8 a total of r times. We say 
that the cycle A = (ejj, . . . ,6^^,) is equivalent to F if there is some integer t such 
that Cj^ = ej^^^ for all s, where indices are taken modulo k. 

It is easy to check that any simple cycle is primitive, backtrackless and tailless, 
and that the notion of equivalence given in Definition 15.51 defines an equivalence 
relation on primitive, backtrackless, tailless cycles. Also, it is clear that, up to 
equivalence, a cycle is backtrackless if and only if it is tailless. The edge zeta 
function of a graph is a way to enumerate all equivalence classes of primitive, 
backtrackless cycles and combinations thereof. 

Definition 5.6. Let F be a path in a graph X with edge set E = {ei, . . . , e„}; 

write F = (e^j^, . . . , 6^^,) to indicate that F begins with the edge e^^ and ends with 
the edge e^^. The monomial of F is given by (?(F) := Ui-^ ■ ■ -ui^, where the UiS 
are indeterminants. The edge zeta function of X is defined to be the power series 
Cx(wi, ■ • ■ ,u„) G . . . ,u„]] given by 

Cx(ui, ■ • ■ = Y\ {l-g{T)) , 

[T\eA{X) 

where A{X) is the collection of equivalence classes of backtrackless, tailless, primi- 
tive cycles in X . 

Although the product in the definition of the edge zeta function is, in general, 
infinite, the edge zeta function is a rational function To make this precise, we 
must define the directed edge matrix of a graph. 

Definition 5.7. [5] Let X = {V, E) be a graph with edge set E = {ei, . . . , e„}. A 
directed graph X derived from X is any pair {V^ E) where E — {ei, . . . , e2n} is a 
collection of ordered pairs of elements of V such that, for 1 < i < n, if = {w, w} 
then {cj, Cn+i} = {{v, w), {w, v)}. (Thus we may think of X as having two directed 
edges, with opposite directions, for every edge of X.) The directed edge matrix of 
X is the 2n x 2n matrix M = (my) with 

{1, if feeds into e, to form a backtrackless path 
0, otherwise. 

The directed edge matrix of any directed graph A" of X is called a directed edge 
matrix of X. 

Theorem 5.8. 8 The edge zeta function Cx(ui, . . . , Un) is a rational function. 
More precisely, for any directed edge matrix M of X , we have 

Cjf(ui, . . . = det(/ - UM) = det(/ - MU) 

where I is the identity matrix of size 2n and U = diag(ui, . . . , u„, Ui, . . . , m„) is a 
diagonal matrix of indeterminants. 

The next theorem gives the connection between the pseudo-codewords of a cycle 
code and the edge zeta function of the normal graph of the code. Its proof was 
sketched in [SI, and it is generalized in Theorem 15 . 31 below to the case in which all 
bit nodes of the Tanner graph have (arbitrary) even degree. 

Theorem 5.9 (|5j). Let C be a cycle code defined by a parity-check matrix H 
having normal graph N :— N{H), let n — n(N) be the number of edges of N , 
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and let ^jv ■— Cn{ui, . . . , u„) be the edge zeta function of N . Let pi, . . . ,p„ be 
nonnegative integers. Then the following are equivalent: 

(1) u^^ . . . has nonzero coefficient in C,^ ■ 

(2) (pi, . . . ,p„) is an unsealed pseudo-codeword for C with respect to the Tanner 
graph T — T(H). 

(3) There is a backtrackless tailless cycle in N which uses the i"* edge exactly 
Pi times for 1 < i < N . 

Definition 5.10. The exponent vector of the monomial u^^ . . .u^ is the vector 
(pi, . . . ,j3„) e Nq of the exponents of the monomial. 

Example 5.11. It is shown in |S] that the edge zeta function of N , where N is the 
normal graph of the code C given in Example 15. 21 satisfies 

Ca'(ui, . . . , U7) "'^ = 1 — 2M1M2U3 + u\u\v^ — 2ur,UQU7 + 4:UiU2U3U5UqU7 

— 2uiU2U^Uc^UqU-j + UiU2U^U^UqU-j + 4uiU2U3M4U5ltgUY — 4U]^U2'^3%'"5'"6^7- 

Expanding out the Taylor series, we get the first several terms of Ca?: 

CAf(Ml, . . . , U7) = 1 + 2U1U2U3 + iufulul + 2U5U6W7 + 4M1M2U3M5U6U7 

+ 6UiU2UjUzUqUj + 4:UiU2U3U^UzUqUi + 12UiU2U^U^U5UqUt + SUt^UgUj 

222 222222 2222 

+ 6uiU2U3U^UqUj + 9UiU2U^Ui^UqUj + 12uiU2U3U^Ui^UqUj 

+ o/^ 2 2 2 2 2 2 2 I 
dbUiU2U3U^Ur^UQUj + • • • . 

The exponent vectors of the first several monomials appearing in are 

(0,0,0,0,0,0,0), (1,1,1,0,0,0,0), (2,2,2,0,0,0,0), (0,0,0,0,1,1,1), (1,1,1,0,1,1,1), 
(2,2,2,0,1,1,1), (1,1,1,2,1,1,1), (2,2,2,2,1,1,1), (0,0,0,0,2,2,2), (1,1,1,0,2,2,2), 
(2,2,2,0,2,2,2), (1,1,1,2,2,2,2), (2,2,2,2,2,2,2), ... . 

Note that most of these lie within the integer span of the codewords in C; for 

example, 

(1,1,1,0,2,2,2) = (1,1, 1,0, 0,0,0) + 2(0, 0,0, 0,1, 1,1). 

The exceptions thus far are 

(1,1, 1,2, 1,1,1), (2, 2, 2, 2, 1,1,1), (1,1, 1,2, 2, 2, 2), (2, 2,2, 2, 2, 2, 2). 

The first of these exceptions is exactly the unsealed pseudo-codeword of the code- 
word a — (1:0, 1:0, 1:0, 1:1, 1:0, 1:0, 1:0) on the double-cover T of the Tanner graph 
T in Example 12. 61 and the rest lie within the integer span of this pseudo-codeword 
along with the codewords. A 

The following corollary gives an algebraic description of the fundamental cone 
in the cycle code case. 

Corollary 5.12 (0). The Newton polyhedron of , ">■■£■, the polyhedron spanned 
by the exponent vectors of the monomials appearing with nonzero coefficient in the 
Taylor series expansion of Cn, is exactly the fundamental cone IC{H) of the code 
C. 
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6. The General Case 

In Sectional we saw that if C is a cycle code on a graph N, then the edge zeta 
function Cat of the graph N has the property that the monomials appearing with 
nonzero coefficient in the power series expansion of Cat correspond exactly to the 
pseudo-codewords of C. It is a natural goal to find such a function for more general 
LDPC codes. In this section, we make some progress toward this goal. 

A Tanner graph is called bit-even if all the bit nodes in it have even degree. Let 
Hq be a binary matrix and let Tq = T{Hq) be the associated Tanner graph. If Tq is 
not bit-even, let H be the matrix obtained from by duplicating each row of Hq. 
Then the Tanner graph T corresponding to H is obtained from Tq by duplicating 
all the check nodes and drawing an edge between a bit node and a copy of a check 
node if and only if there was an edge between the bit node and the original check 
node, so that T is bit-even. Certainly, Hq and H (i.e., Tq and T) describe the same 
code. Moreover, it is clear from Definition l4.1l that they have the same fundamental 
cone, and hence, by Theorem 14.41 the same pseudo-codewords. Thus, to describe 
the pseudo-codewords which arise when we use Tq to decode, we may equivalently 
describe the pseudo-codewords which would arise from the (redundant) parity check 
matrix giving rise to the Tanner graph T . Our next task, therefore, is to describe 
the pseudo-codewords associated to bit-even Tanner graphs. 

Remark 6.1. Given a Tanner graph Tqi the procedure described above of dupli- 
cating all check nodes will always produce a bit-even Tanner graph with the same 
fundamental cone (and hence the same pseudo-codewords) as our original Tanner 
graph. In some cases, it may be possible to produce a Tanner graph with these 
properties by duplicating only some of the check nodes. This "smaller" Tanner 
graph may be desirable in practice. 

We first describe the codewords of a code with bit-even Tanner graph T in terms 
of cycles on T. 

Proposition 6.2. Let C he a binary linear code and let T he a Tanner graph 
associated to C . Assume that T is bit-even. Then the codewords in C correspond 
to disjoint unions of edge-simple cycles on T such that at each bit node x of T , 
either all or none of the edges incident to x occur. 

Proof. Let X be the set of bit nodes of T and let F be the set of check nodes of T. 
Fix a binary vector c = {cx)xex- We know that c is a codeword in C if and only 
if, when we assign the value Cx to every edge incident to the bit node x, the binary 
sum of the values of the edges incident to each check node is 0. In other words, 
associate to c the subgraph T(c) which has as left vertices those x € X such that 
Ca, = 1, as right vertices those f € F which are joined by an edge in T to at least 
one of these x, and as edges all the edges in T between these x and these /. Then c 
is a codeword if and only if the degree in T(c) of each / is even. Since the degree of 
each X is even by assumption, we see that c is a codeword if and only if the degree 
of every vertex in r(c) is even. The result now follows immediately from Euler's 
Theorem Th. 1.2.26]. □ 

Using Proposition 16.21 we may view a binary linear code with bit-even Tanner 
graph as a punctured subcode of a cycle code as follows: Let C C F2 be a binary 
linear code with associated Tanner graph T, and assume that T is bit-even. Let C 
be the cycle code on T. Let xi, . . . , a;„ be the bit nodes of T, and label the edges 
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of T (which correspond to the coordinates of C) so that the edges incident to the 
bit node Xi are labeled e(i^i), . . . , e(^i^di), where di is the (even) degree of Xi. Let 
N = Yll=i di be the number of edges in T and define : by 

0(C(1,1): • • • : C(i,di), . . . , C(n,l)'-- ■ ■ ■C(n,d„)) = (C(l,l), • ■ • , C(„_i)), 

i.e., (j) picks off the first coordinate in each of the n blocks corresponding to the n 
bit nodes Xj. Let C be the subcode of C consisting of codewords 

(c(i4): ■ • • :c(i,di), ■ ■ • ' C(„,i): . . . :c(„^(j„)) 

where C(^ij) = C(^i,i) for 1 < i < n and 1 < j < di. Then the restriction of to C 
is an isomorphism to C by Proposition 16. 21 In other words, C may be regarded as 
the code obtained by puncturing the subcode C of C on the positions with 
2 < j < dj, for l<i<n. 

Next, we describe the pseudo-codewords of a code with respect to a bit-even 
Tanner graph T in terms of T. 

Theorem 6.3. Let C be a binary linear code with associated Tanner graph T. 
Assume that T is bit-even. Then the unsealed pseudo-codewords of C with respect 
to T correspond to disjoint unions of backtrackless tailless cycles on T in which all 
edges incident to any given bit node occur the same number of times. 

Proof. We first set up some notation. Let H be the parity-check matrix for C 
associated to T and let K. — IC{H) be the fundamental cone. Let n be the length 
of C, let xi, . . . , Xn be the bit nodes of T, and assume T has r check nodes so that 
if is an r X n matrix. 

Assume that p = (pi, . . . ,p„) is an unsealed pseudo-codeword of C with respect 
to the Tanner graph T. Then there is a codeword c in the code corresponding to 
some finite cover tt : T ^ T oiT such that the unsealed pseudo-codeword associated 
to c is p. Since T is bit-even, we have by Proposition |^21 that c corresponds to a 
union A of edge-simple cycles on T such that at each bit node x of T, either all or 
none of the edges incident to x occur. Taking 7r(A), we get a union of backtrackless 
tailless cycles on T in which all edges incident to any given bit node occur the same 
number of times, as desired. 

Conversely, suppose we are given a union A of backtrackless tailless cycles on T 
in which all edges incident to any given bit node Xi occur the same number, say pi, 
of times. Let p = {pi, . . . ,Pn)- We know that Hp^ = G Fg since A is a union of 
cycles, and we need to show that p G A^. Certainly equations H4.1(l hold for u — p. 
The expression hjiPi counts how many edges in A go between the bit node Xi and 
the check node fj. Since each A^ is backtrackless and tailless, every time A^ goes 
from Xi to fj, it must continue to some x[ ^ Xi. This means that the number of 
edges in each A^ which go between Xi and /j is at most the number of edges which 
go between and all Xi' with i' ^ i. Thus 

h■j^>P^> > hjiPi 

i' 

for each i and j, i.e., equations 1)4. 2|l hold. Hence p G /C and so, by Theorem 14.41 
p is a pseudo-codeword. □ 

Using Theorem 16.31 we can describe the pseudo-codewords of a binary linear 
code C with respect to a bit-even Tanner graph T in terms of the exponent vectors 
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of the monomials appearing with nonzero coefficient in a certain power series. We 
saw above that C is equal to 4'{C'), where C" is a subcode of the cycle code C on 
C, and (j) is the map which punctures on all positions with 2 < j < di for 

1 < i < n. We also have a map on the power series rings, which we will again write 
as (j): 

(j) : Z[[U(i,i): ■ • • : ■ ■ ■ ,M(n,i): ■ • ■ ■U{n,d^)\] ^ ^[[""l, ■ • ■ ,Mn]]- 

This map 4> is induced by 



Let 



Mj if j = 1 
1 otherwise. 



C = Ct(u(1,1), • ■ • , "(l.di), • ■ • , "(n,!), ■ • ■ , 

be the edge zeta function of T, so that unsealed pseudo-codewords of C with respect 
to T are precisely the exponent vectors of the monomials appearing with nonzero 
coefficient in the power series expansion of C by Theorem 15.91 By Theorem 16.31 
the unsealed pseudo-codewords of C with respect to T are the unsealed pseudo- 
codewords of C with respect to T in which all edges incident to any given bit node 
of T occur the same number of times. If we let C,' be the power series obtained from 
C by picking off those terms with monomials of the form 



n 



U,. .^' 



1 < i < TO 

1 < j < d. 



with P[ij) — P{i,i) for 1 < j < di, then the unsealed pseudo-codewords of C with 
respect to T are precisely the exponent vectors of the monomials appearing with 
nonzero coefficient in the power series (f>{C)- 

The above discussion is summarized in the following theorem: 

Theorem 6.4. Let C be a binary linear code with Tanner graph T , let T he a 
bit-even Tanner graph obtained by duplicating some or all of the check nodes of T , 
and let C be the cycle code on T . Then C is a punctured subcode of C. More- 



over, 



after choosing a suitable labeling of the di edges of T , where di is the 



(even) degree of the i*'' hit node of T , the unsealed pseudo- codewords of C with re- 
spect to T are precisely those vectors {pi, . . . ,p„) of nonnegative integers such that 
'^('ij) '^PPfiCj^'S with nonzero coefficient in the power series expansion of 

1 < i < n 

1 <J <d, 

the edge zeta function of T . 

Remark 6.5. When C is a cycle code on a graph N , we saw in Section 5 that the 
associated zeta function C,iq is a rational function whose Taylor series expansion 
records all pseudo-codewords of C. For a general LDPC code C with associated 
Tanner graph T , it would be very interesting to find a rational function, arising 
combinatorially, such that the monomials occurring in its Taylor series expansion 
are precisely those in 0(C') constructed above. 
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